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INTRODUCTION 


0,1 Scientific Structure of the Artificial Intelligence Laboratory 


The- activities of the Artificial Intelligence Laboratory can be 
viewed under three main aspects: 

1, Artificial Intelligence: Understanding the principles of 
making intelligent machines along the lines discussed irt 
previous proposals* and elaborated below, 

2.. Natural Intelligence: As we understand Intelligence better 
we see fewer differences between the problems of under¬ 
standing human and machine intelligence. 

Me have been 1ncreasingly able to translate our Ideas about 
programing machines into ideas about educating children, and are 
currently developing systematic methods In elementary education* And 
conversely,, we attribute to our observations and experience in the 
Tatter activities much of what we believe are important new conceptions 
of how to organize knew ledge for programs that really understand. 

3, Mathematical Theories: This aspect is relevant not only 
because we often need to solve specific mathematical 
problems but especially because we are firmly committed 
to maintaining a mathematical style in the laboratory. 

In many centers we have seen decline and deterioration 
following an apparently successful “experiment" in 
artificial intelligence because the principles behind 
the performance were not understood, hence the limita¬ 
tions unseen. 
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Organization of, the Laboratory 

Organizationally our laboratory "is divided into a number of 
groups Mr'S tin a certain degree of overlap (both in people and in Interest}. 

VISION? prograuinl ng machfnes to see 
ROBOTICS? programmlng mechanical manipulations 
LANGUAGE? programs that understand English 
PLANNER: iirplementation of a new programming language 
PROGRAM-UNDERSTANDING: programs that understand processes 
MATHEMATICS: especially schemata* complexity* theory of 
computation 

MATHLAB: (In cooperation with the group In Project MAC} 
"l.TiS.": The A.I. Lab's time-shared computer system 

DEVELOPMENTAL PSYCHOLOGY: the educational research group (N5F) 

HARDWARE: experimental shop facility 

ADMINISTRATION. 

For the past year, the Artificial Intelligence Laboratory has 
been an independent M.I.T* Laboratory, separate from Project MAC* 

(The giroup was started In 1959 as part of the Research Laboratory of 
Electronics and the M*L*T+ Computation Center and later became a group 
of Project MAC, It has grown to a size and complexity that now needs 
its own administration*) The funding Is mainly from ARPA* the studies 
in Education and Child Development are supported by the NSF* and there 
are some smaller support contributions from NASA, NIH, and from some 
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spurees of fellowships and assistant ships, 

In the area Of ROBOTICS, we hive come to the conclusion that 
the research directions that have been followed, while extremely 
productive for Artificial Intelligence research, have not been effective 
enough In stimulating practical deyeloptnents in the area of advanced 
automation. Accordingly, we have proposed: to ARP A* In a separate 
Supplementary letter, to consolidate what we new know in the design 
of a complete and easily copied mini-robotics laboratory system, as a 
packaged system* so that other tenters can get started on applied 
research in that area. 
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1, Progress in the Hand-Eye Project 

Progress in our laboratory can be seen on different levels* 

Very specific projects* such as the hand-eye work supported by AftPA. 
achieve very specific and Increasingly ambitious goals. Several years 
ago we were struggling with the problem, of making a computer see any- 
thing at all* and were proud of the first demonstration In which the 
computer could see a visually Isolated, clear, well-fl Tuminated 
shadow-free cube well enough to add It to a tower, last year our 
seeing machine was able to Took at a coirfilex of mutually obscuring 
objects, analyze the scene visually, dismantle ft and put it together 
again in mirror image. 

The first scene copied was as shown below. Note 
that objects obscure one another by way of support 
and in-front-of relationships. The objects are not 
of any particular sizes both Site and positfon are 
determined in the copying effort as required- 



Since then we have Increased Its ability to tolerate less than perfect 
lighting conditions* see shadows, deal with even more obscured Objects 
and sq on. 

But, our progress appears as quantftatfveTy continuous only 
when viewed very superficially, A deeper examination shows qualitative 
changes In the methodology of the work. In the kind of question we ask 
and answer, and above all in the kind of theoretical concluslons_ we 
draw from it. To define the trend of our work and, particularly, to 
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place next year's work In its proper cortte xt T we need to rev few our 
progress on these more fundamental levels. 

So, let us review briefly the progress in vision* this time 
examining the style of operation of the programs rather than their 
Outer behavior. 

Seeing and picking up an isolated object under good conditions 
does rot actually need any of the techniques of programing or 
theoretical concepts we see as characteristic of "artificial -intelligence", 
It car be handled as an interesting* difficult, but typical problem 
In classical programing and engineering. 

The main Initial direction forward from the simple case in 
our laboratory was the development of what we now call "mini-theories" 
of Important visual phenomena, Some of these concerned problems of an 
optical character, such as the light distribution at the edge of an 
object, the deduction of surface curvature from shading* the local 
structure of shadow lines and so on, Others were of a geometric 
character, such as the classification of vertices and edges into 
regions, the dissection of scenes into separate objects„ defining 
relations between objects In three dimensional space. 

Me consider our programs as belonging to Artificial Intelligence 
from the time they begin tq use such mini-theories. Eut merely using 
them is only a beginning: the truly deep questions are connected 
with how they are used. And the major focus of our work has turned 
toward defining and understanding phenomena of interaction of diverse 
kinds of know1 edge T 
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An early example of such interaction in the vision 
system occurred in the process of locating objects 
In space. One straightforward method using focus 
was used with a more complicated module that depends 
on heurtsttcally derived conjectures about how objects 
in the scene support each other, The focus method 
Is less accurate, but is immune to the grass blunders 
occasionally made by the support-dependent module. 

When used together one can get both great ret I ability 
and great precision. 

The simplest way to use mini-theories such as those Tisted above 
Is separately , for example 1 rt a pass-oriented or hierarchical program 
structure where 11 optic 11 mini-theories might be used for line detection, 
geometric mini-thepries for parsing scenes and so on. There is no 
doubt that useful results can be obtained this way and we have demon¬ 
strated some; but It is now dear that the real pay-off Is in a different, 
ultimately more powerful direction, illustrated by the Idea of heter¬ 
archical program structure. 

A fair image of what we mean by this can be obtained by thinking 
of ways In which human specialists might be Integrated Into a large 
organization. The pas5-oriented model uses the following pattern for 
solving a problem: the problem is divided in advance Into stages 
or sub-problems, which are p-assed out to the specialists; each special¬ 
ist sees his part of the problem, does what he tan with it, passes 
on the result and is through. Heterarchical organization Implies the 
possibility of eonminication between specialists' consultations, 
sending the problem back for further study and so on. 

The advantages and dlls advantages of each mode of organization 
are plain enough; the first structure lends itself to orderly work, 
but suffers from rigidity and if there is any flaw In the original 
plan it will fail at that point, The second structure is much more 
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powerful QF1 condition that the process does not full Into chaotic 
disorder . Experience of human organizations shows that a free inter¬ 
action can work only if somewhere in the background Is an infra¬ 
structure of conventions and understandings. This Is equally true 
of program organization: one of our central technical problems is 
how to set up such organtjatfonal infra-structures. 

During the past year it has become clear that our conceptual 
and technical tools for this task have reached a critical mass. A 
new style of programming and thinking about programs for Artificial 

Intelligence has almost explosively spread through our own laboratory 

* 

and has even already begun to affect the work at other centers of 
research, The first major example of programing in the new style 
was T. Winograd's program discussed In last year's proposal to ARM, 
Since then* as we anticipated In that proposal, our Vision System 
has been recast Into the new form and several new projects have 
adopted it from the outset. The following section contains a summary 
statement of the main features of these new programs, A more detailed 
account can be found in our forthcoming Progress Report (available 
19-71) ■ which has taken the form of a monograph on the funda¬ 
mental problems of Artificial Intelligence (and Is therefore a piece 
of progress in Its own right as well as a report on this and other 
progress), We do want to emphasize our belief that what we are doing 
really is quite different from the approach of Artificial Intelligence 
generally followed up to now In other centers and expressed in the 
recent round of bonks on the subject. We see the difference as funda¬ 
mentally this: faced with the apparent diversity of kinds of knowledge* 





the conmon approach at other centers Is to seek ways to render it 
more uniform so that very general * "logically clear" methods can be 
used; our approach is to accept the diversity of knowledge as real 
and inevitable, and find ways to manage diversity rather than eliminate 
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Zv Kfiowl edge and. Common Sense 

An exciting aspect of the new programming style is that it 
promises at last to break: up the stereotype of the computer's slavish 
dullness, or "superspeed moron" character. A few years ago, the remarks 
of the previous section would have been appropriate to describing the 
differences between men and computer programs; now they contrast the 
new with the eld program! A good example of a degree of "coin™n-sense 11 
is supplied by the interactions of the programs in the "Blocks World JI 
system that is responsible for the behavior in the attached dialog -- 
[Appendix). The syntactic and semantic systems generate construction 
goals* like "Build a steeple, 11 The definition of the goal ("a steeple 
is a stack which contains two cubes and a pyramid 11 ) is converted into 
a plan -- a step-by-step specification of subgoals — that Is Interpreted 
as a program for building a steeple? find a cube; put another cube 
on it, put a pyramid on that. Now when a first cube is found. 

Its top might be cluttered with other objects. We do not want the 
robot -- like the assembly-machine in Chaplin's Modern Times — to 
smash the second cube down willy-nilly, But we don't want, either* 
to have to write into our plan: "find a first cube and remove the 
things on it" because (1) it is so obvious to say it and {£) it only 
sweeps the problem under the rug: the program will have to find a 
clear surface somewhere to put down the junk it takes off the first 
cube. 

W1nograd's system faces the problem once and for all. An 
almost autonomous network of statements and procedures "know" that 
to put one thing on another there must be a place it will fit. If 
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there is no Such place One must make one (or faff). To (flake a place 
one must move something else, and (recursively} put it somewhere else 
where j^t will fit, etc. But one cannot put something on itself* even 
if there is room, because it wqn K t be where it was at the future moment 
of setting it down after moving it, etc, etc. Now* instead of writing 
this sort Of thing into each particular application program* we create 
once and for all a "micro-world" of knowledge about how things are 
supported, how supports change when things are moved* and so forth. 

This knowledge is Invoked by the occurrence of patterns either in the 
outer world, or In goal-statements, whenever they occur, and the 
common-sense processes intervene and take over the actions until their 
invoking patterns disappear. For example} there Is In the micro-world 1 
a statement whose effect is 

"If A Is supported by e» and A is moved* then erase 
from the current descriptionof the situation! any 
statement of the form 'fi supports A.?" 

Notice the indirect character of this. It is a statement not about 
the physics of support but about when to forget statements concerning 
support! (When you paint an object* and then move it. analogous 
statements should not disappear,} Due could conceivably do without 
this advice, at the cost of recomputing: after each change In the world 
all the relations between all subsets of objects. This is impractical 
and is a common cause of examples in which a system works on "toy 
problems" and collapses on real problems. Or one could do without this 
advice* at the cost of recomputing after each change In the world all 
the logical consequences of that change; this leads in a different way 
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to collapse. One cart make heuristic compromises: motions change 
geometrical relations hut not (usually) other attributes of objects. 

The art and science, eventually here is in finding what are 
the points that are so inrnediate that One should know them directly* 
and which can be left to more general but more laborious deductive 
systems - 

Conmon sense is not magic. If we want our computer to act as 
though it knows the elementary strategies about physics and geometry* 
we must give It that knowledge somehow. But we need not do this anew 
for each program! So Oyr goal is to learn how to refine the Ideas 
in the Blocks World* and the ideas in pattern-matching invocations that 
make this knowledge engage relevant situations* so that we can keep 
this u data“ permanently in the system. Then apy program written therein 
will automatically behave sensibly in that sphere of activity* 
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3, Some Features of th# Mew Prog ramni rig Style 

Ca} Use of the language PLANNER 

Programming languages such as ALGOL are designed to make 
ft easy to express the kind of statement certain classes 
of users are likely to require: for example, algebraic 
expressions and repetitive loop structures, in program¬ 
ming for Artificial Intelligence we need to- express very 
easily such Instructions i 

To achieve Goal A, set up 
sub-goal B arid If this 
falls* try C or 0, 

PLAINER Is a language designed to be highly expressive 
in talking about just this kind Of advice. 


Example : Art example that illustrates PLANNER'S expres¬ 
sive power is that a graduate student, Ira Goldstein, 
was able to write a PLANNER program rather like the 
old Gelemter Geometrical-Theorem-Prover — but not 
only did Goldstein need only a few days of work* cnee 
made, his program could easily be modified to handle 
classes of proof involving constructions which Gelernter 
was unable to do in a Tong period of work, This 
ability to p-erform experiments in Artificial Intelligence 
rapidly and flexibly is a most decisive change in the 
style and fertility of work. 


(b) Automatic Mechanise for Fallible, and Contingent Instructions 

The Image of a classical numerical program Is as a sequence 
of actions performed one after the ether and expected to 
succeed. In Artificial Intelligence, programming an action 
might be an attempt that fails. In this case it is necessary 
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to ensure that when it gives up, the micro-world In 
which it works Is not left untimely cluttered. The 
PLANNER system contains powerful mechanisms to take 
care of such situations. 

Exattpl o : The examples already cited of how the HI nograd 
program bangles its Blocks Words Is a typical case. If 
the proposed action Includes a statement-erasure the 
failure back-up car restore the description: to its for¬ 
mer state with, perhaps, some editing relevant to the 
failure. 

(c) Procedural Description of Knowledge 

It is slightly ironic that the most popular approaches 
to Artificial Intelligence program!ng force knowledge 
to be stated im a fern of "logic" as assertive proposi¬ 
tions such as: 

The Box Is Red* 

flow* there is no problem hero, in simple statements 
about attributes. But other kinds of knowledge are 
much better stated as procedures [or "programs") 
rather than as facts * Even such a simple statement 
as: 

"Lf there are no cars coming, cross the road." 
Is misleading if translated into a logical implication 
such as: 

For every x, [{x Is car) S (x is not coming)] 

~—■/ [crossing is permissible]< 

If is more naturally transcribed as: 
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'U&k left, look right, if you hadn't seen 
a car gross. " 

The distinction between these modes of expression Is not 
merely verbal. A deeper aspect is seen by picturing a 
"logical theorem proven" trying to prove by resolution 
or other logical principles that no car Is caningJ 
Deeper yet are the consequences in more cwiplex situations 
we belteve that even In the size of programs we are now 
using they make a difference between easy programming and 
very difficult programing, With more complexity we 
believe that the difference can become one of possible 
versus impcsslble. 


(d) An Eaample of Procedural Hecess1tv 

The concept of "nearness" is a good example showing 
procedural definitions to represent knowledge, Everyone 
knows what "near* means, If we are told that 


The car Is near the garage, 
and 

The garage is near the house 


then we can be sure that the car is near the house, But 
we cannot put this "transitivity" Into a formal logfcal 
system by a rule like 

Rule 1 : {A near B) and (B near 0=^ [A near C) 


for unrestricted application of this rule would yield 
absurdities like “l is near 100" because 1 Is near 







1.001 andl.QQT is near 1,002-, etc., etc.l 

It Is clear what Is needed* In any particular 
context, "NEAR" Is used to represent a certain size 
range, and If one uses eh a f ns of longer than, a very 
few' steps, one may get out of that range. The obvious 
thing to do Is to add 

Rule 2 : Don't use Rule 1 more than [say) four times, 
unless there is so™ basis for believing that you 
are still In the same size range* 


This cannot be said In any ordinary "logical system"! 

No system of "mathematical logic" allows statements 
Inside the logic to talk about the deductive process 
that uses the logic* This, we claim, is disastrous 
for Intelligent systems because In solving a hard problem 
one must devote much attention to monitoring and 
planning the problem-solving activity! We can state 
Rule 2, or the equivalent* in the PLANNER language. 

(The only earlier system we know of in which one could 
do this was Tettelman's PILOT.) 

Meta^technfcal Remark 

This'TttYtucfe’ is not shared by most ether groups 
working on artificial intelligence, and we feel that 
the widespread committment to try to represent ordinary 
reasoning in terms of a "consistent" mathematical logic 
system is having almost as bad an effect as was the 
earlier preoccupation with perceptrons and linear- 
separation clustering algorittims — another kind of 
attempt to find a uniform way to represent all different 
kinds of things. 

An interesting sidelight on this Is the phenomenon 
in which many people Interpret Godel's theorem as 
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showing a difference between men and michlnes. 

It does not* What it says is that any system which 
is able to discuss its own procedures, and apply 
these to itself, has the potentiality of deducing 
some falsehoods* It is perfectly possible to 
program computers to be able to discuss their own 
procedures — W1 no-grad 1 s program comes close to 
this — and we believe that this Is the best 
path toward Intelligent programs; "Rule 2" Is a 
Simple Instance, We do not believe that enough is 
known, today, to make worthwhile the search for 
an adequate and '"consistent" logical intelligent 
system [that is, one that is inherently unable 
to have an^ self-contradictions’)* This has never 
been done even for ordinary mathematical arguments, 
to say nettling of everyday common sense* 

What men do is much more PLANfO-like, 

Suppose that you deduce a contradiction In some 
argument or find that some plan you expected to 
work out did not* One then looks backwards and 
tries to "localize" the trouble* Then one makes 
a pattern-invoked heuristic rule, to prevent that 
kind of deduction being made again* '"Rule 2'" is 
just such a device* A superb non-trlvial example 
is provided by the way real mathematicians deal 
with "naive set theory"* They do not reject ft* 
as did Bertrand Russell in his attempt to rebuild 
mathematics without it* Instead, th^y now have 
rules like: if you-r statement resembles Russell's 
Paradox (because it talks about itself) then be 
careful; find another way to do it. It is astonish 
ing how few such caveats have been adequate to 
keep contradictions out of ordinary raatihematlcs* 

And we believe that among the most Important forms 
of human knowledge are just such rules that indi¬ 
cate which lines of thinking are unsound * Another 
example: whenever One finds a theoretical outlook 
which can explain things "too easily'", as do 
mystical concepts of "unity in everything" Or the 
dialectical elements in Freud's Theory {In which 
many causes can produce opposite effects) one says 
to himself (unless he is still adolescent): "This 
method is too good. That means the inconsistency 
is too close to the purposes I want tt for, 11 
Then one tries to build op protective knowledge 
structures for preserving what one can (Freud 
proposes many valuable new ideas about how know¬ 
ledge is represented — and misrepresented!) of 
the new theory. 

In taking this path (and we believe It Is 
the only promising approach, today* to under¬ 
standing intelligence), we must be very clear 
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abeut the risks. We must understand 1 that as our pro¬ 
grams get better ait analyzing their own processes * and 
incorporate better heuristics for preventing unsound 
kinds of reasoning from emerging, the possibilities 
of contradictions become buried more deeply but are 
not eliminated, We must not be entirely diverted by 
good empirical results. We must not give up on trying 
to get a complete theory. It is certainly not incon¬ 
ceivable that we can construct a systematic theory of 
Cdisistenty in a logical area wide enough to cover 
all important areas of intelligence* one must be on 
guard against other unwarranted (but common-sensei] 
pessimistic misinterpretations of Godel-like theories. 


[e] Mini “theory Construction as a Technical &oal 

He do not see the problem of Artificial intelligence as 
one of programming existing knowledge ; it also involves 
the acquisition and classification of new substantive 
knowledge about such areas as; intentions; excuses; 
goal structures and so on. A curious feature is that 
such enquiry Insofar as It has been conducted at all 
in the past* has been the domain of analytical philosophers 
and literary critics. Their analyses were* however* 
limited by their lack of computational models* so we 
are able to go further. Nevertheless* doing so depends 
on the acquisition, by people working on Artificial 
Intelligence* of sophistication in areas of thinking 
far removed from the content of "computer science" courses. 

Example ; To Illustrate the point we give an example from 
E, Cnarirlak L s work on making programs understand narra¬ 
tive, This work was mentioned in last year's proposal, 
and has recently reached the level of an operational 
program capable of answering questions such as In the 
following example. In the left column Is a narrative 





taken from a children's reader, (In Its present fern 
the program interacts in a special format; tie have 
translated this into English, for simplicity of reading.} 


STORY 


Jack and Janet are in the 
house. Jack is holding a box 
of pencils and a box of paints. 

"Janet* see the paints and 
pencils that Daddy get for us, 1 ' 
Jack said. 

Janet went to look at them. 

"Are the paints for me?" 
she asked, 

"No* the paints are mine/ 
said Jack, "The pencils are 
for you* Janet/ 

Janet said to" herself*" I 
want the paints.* 

Jack began to paint a pic¬ 
ture of a read airplane. Janet 
went to look at it. 

"Those paints make your air¬ 
plane look funny," she said., 
"You could Hake a good picture 
of a red airplane with these 
pencils," 


Question: Why did Janet say that 
the paints were bad? 

Answer: She wants the paints. 

Comments: The program must interpret 
"funny" as “bad" in this 
context (This Is not done 
for it in the input format]. 
Even then Janet really said 
that the picture was had, 
and it is necessary to 
transfer this to the paints, 
and then It must know that 
if you want something another 
person has* you might make 
nasty comments about it in 
ordfir to get it, If we had 
asked, Is the picture funny* 
the response would have been 
[in essence} "Mo* she said 
so tut she had an ulterior 
motive,* To do this the 
program needs a lot of infor¬ 
mation about wanting* trading* 
giving* owning, statements can 
not be taken at face value — 
translated Into simple logical 
statements. They must be 
treated as evidence for the 
program to use to build a 
model of what really might 
be happening. 
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Spetiflc Directions for 1972 

(a) Vision 

In lad year's proposaT the main goal for vision was creating 
a new heterarchical system In PLANNER. This was done by a team con 
listing of P. Winston* B, Korn and E. Fneuder. The new system 
quickly proved itself by performing a "copy demonstration" In which 
assemblies of blocks were analyzed, taken apart and re-assembled in 
mirror Image! This represents a very definite advance on the state 
of this art. 


This demonstration Is significantly different from 
aqy we have made before in Its generality. The 
earlier demonstrations illustrated techniques that 
We could (and often later did) use in other contexts. 

But the demonstration itself was usually rather 
rigidly unchangeable. The new vision system can be 
interfaced as It stands to almost any task. . * * 
buildfng towers, putting blocks into boxes etc* etc* 

Of course It has limitations and needs improvement* 

But It does not need to be reprogrammed to be used 
for a very general class of othe-r tasks. 

The weakness of the new vision system is its failure to make 
effective use of all the heterarchical capabilities built fnto it. 
Indeed at the moment it works as if it were a series of separate 
heterarchical programs. But since the facilities for greater inter¬ 
action are there* it will make continuous progress. 

The most Immediate weakness of this sort Is a bottleneck 
of eonmunlcation between finding lines and Interpreting 
scenes. Work Is currently focussed on removing this. 

Curing the next year the system will be extended In the following 


directions; 






[a) A series of new "mini-theories" to enable the 
system to use {rather than be bothered by!} 

Shadow lines as a source of information. 

{b} More interactive line-finding. 

[c) Other representations of objects than as sets of 
lines. 

The possibility of multiple forms 
of representation for the basic 
elements of the system Is the 
consequence of heterarchy and one 
of the more exciting sources of 
theoretically interesting problems. 

C<0 Range-finding. 

fe} The ability to confine its analysis to aspects of the 
scene with a high degree of relevance to the immediate 
question. At the moment the program still collects and 
analyzes much more information than It needs* 

The following situation illustrates whit we have In mind here- 



If the machine wants to pick up object A, -ft soon realizes that its 
position-!ocating module first requies some information about the 
size and position of B. Consequently B is examined as a result of a 
goal to move A, but C Is properly Ignored as irrelevant to the goal. 
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(b) Robotics 

We shall concentrate on problems described In our separate 

Tetter on mini-robotic 5 . 

(c) Language 

There are a number of new areas that demand Investigation; 

What are the problems In handling other 
aspects of ordinary graranar? 

What new primitives Kill be needed as 
new words are added, etc. 

These problems are Inseparable from those that arfse from adding a 
wider range of "meanings 11 to the entire semantic system. What happens 
when we try to extend the system beyond those problems that it now 
can handle with respect to the blocks world? There are alternative 
concepts of how to proceed, and let us consider two extremes: 

One approach Is to add to the blocks world incrementally. It 

1 

Is easy to add new kinds of objects* and properties for them* to 
the syntactic-semantic-problem solving complex. 

It is somewhat harder to add new predicates about spatial relations 
for example, "NEKT-TO" might be important In some problems. But such 
an increment means that one must also add new procedures for taking 
account of such new elements* be they mentions* explicitly In a main 
goal derived from a natural language command* or arise internally in a 
description invoked by an already present theorem. The new procedures 
represent ways to solve problems and understand situations, but they 
cannot efficiently be used unless "recommendations" are added to the 
theorems of the older knowledge base so that the new knowledge can be 
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invoked when (and only when) relevant. Recommendations pointed the 
other way are needed, too, and modifications most be made to the 
Meanings" of appropriate words so that the syntactic and semantic systems 
can handle the nuances associated with the wider spectrum of meanings that 
the system is now required to deal with- 

Presumably! as this kind of incremental extension is made, some 
changes will be easy — whenever the system does something that 11 it 
should know better than to do* the programmer can intervene and attempt 
to adjoin new "advice" as a new theorem, recommendation„ or entry in the 
dictionary, or fragment of PRQGRAW1AR or PLANNER program. But sometimes 
this will he found very difficult, because instead of a small addition 
or change the system will want a new kind of data-structure* or a new 
heuristic strategy for achieving a new kind of goal. 

For example, U1npgrad l s Blocks World dees not have a variety of 
“uses" for the mechanical structures It builds. In particular, it does 
not have any concept* at present, of multi pie-support-upwards, It can 
deal with situations In which one object supports many others ( as several 
stacks on a cube on a table, hut It cannot handle such structures as 
bridges and arches f« which one object ts supported: by several. When 
a child builds a high tower that turns out to be unstable, he has ideas 
about "reinforcing 11 if by providing multiple support to lower elements. 

How hard Is it to add such concepts? He do not really know, yet. 
But It seems clear that It would be hard to do "incrementally* because 
the data-structure In the original blocks world assumed single support, 
and all the ’’theorems 11 about construction and Interference between goals 
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are written In terms of this single support. 

Another approach opposed to the incremental would be based on a 
concept of "microworlds" (referring to suborganiiations of knowledge* not 
to physically separable parts]. The original "blocks World" Is an elegant 
nitni-theory which, by itself,, is a highly satisfactory model of certain 
kinds of interactions between purposes and physical relations. Is there 
no way to preserve its effectiveness* intact, in a larger system? 

If the new area of meaning were very different It would be much 
more clear what to do. If, for example* we wanted to talk about what the 
objects of the blocks world were used for, (boxes are used to store things 
away that one does not expect to need soon, towers are used to make high 
structures* pyramids aren't used for anything, etc.) we would have little 
trouble. He could build a different micro-world about block-structures 
and their uses, and procedures for designing structures we needed for 
different purposes, and then turn to the blocks world to find how to make 
the structures. Presumably* ft would not be excessively hard to add to 
the system a collection of theorems and recoanendatlons that would serve 
to tie the two separate micro-worlds into a system that could solve 
problems that need both kinds of knowledge. 

Even this has not yet been done, however* So one of our goals is 
to accumulate experience in finding ways to interconnect two coir^aratively 
separate micro-worlds, another is to get experience in the kinds of 
problems involved in extending an existing micro-world. The first 
experience is* we predict, somewhat more valuable, for if it can properly 
be done, the interaction advice may be able to survive the extension of the 
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sub-microworlds involved. 

In any ease* ft Is our conviction that there is no plausible 
alternative to this idea of structuring knowledge into reasonably coherent 
packages, 

£d} PLANNER 

Work is in progress on further and better implementation of 
the language. The limited version known as "micro-planner" 
is being used in several other centers. We like this, of 
course, but feel concerned lest a restricted form of the 
language become too well established* and will take what 
steps we can to make the best possible form generally 
ay it liable,, In particular, we are concerned to make the 

language available over the AfiPA net. 

(e) Program-Understanding Programs 

This is an area in which rapid growth fs very Ifkety. It 
Is closely related to the Natural Language project and to 
the structure of PLANNER, The explicit goal is to write 
programs capable of understanding programs. 

In addition to the central Issues described before, we hope 
over the next year to go further into systems that understand, in various 
senses, more about processes. There are a number of different aspects 
of this general goal that arise over and over again in work on artificial 
tntel1igence and, indeed, in many branches of computer science. For 
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example: 


Compilers do not have much idea &f t 
they are doing, Programs are compiled 
without any sense of the Intention of 
the source program, and the resulting 
code is produced without any awareness 
of imputations of the specified 
process. 

Operating systems do not know what they 
are doing, either. They are dealing 
directly with the execution of processes 
but have no semantic model of what is 
Involved; 

Learning programs like that of Winston produce descriptions of 
structures from a sequence of examples that have been presented. Much 
of learning Involves the acquisition of new processes. We do not foe- 
Ifeve that there is necessarily a large difference between acquiring 
descriptions that represent the structures of objects and descriptions 
that represent the specification of procedures. &ut If we are to be able 
to do the latter, we experience with problem-solving systems that deal 
directly with the semantics of programs, and this experience is generally 
lacking. 

More generally, in order for a learning program to be versatile, 
ft has to be able to analyze procedures that it learns, to adapt them 
to new situations, to debug them, etc. We believe that human children 
who are not abTo to cope with complicated situations are that way because 
their process-understanding capability is inadequate. 

We are considering several approaches to developing competence in 
this area of procedure understanding programs. 


1. Carl Hewitt has been developing a formalism, called INTENDER, 
in whlcb one can associate with definitions of procedures semantic state¬ 
ments that make assertions about the effects that the procedural state¬ 
ments are supposed to have. In a sense, this Is a sort of formaliiatiOn 
of the semantic clues programmers frequently leave In the form of 
mnemonics and comnents. There are a variety of ways to use these statements, 
ranging from proving that the procedures will have the Intended effects 

to making the procedures adaptable to use by larger systems that can 
understand and change the code,, or translate It Into other source languages. 

2 . Hewitt has also developed a system for ‘'Procedural Abstraction, 1 ' 
described in his thesis which observes partial protocols of the behavior 

of a program. The protocols provide evidence about the different directions 
that program branches can take, with different data, and the system con¬ 
structs proposed programs that are minimal models in the sense that they 
contain just enough structure to account for the behavior* In effect, 
the system is an abstract learning program that can duplicate behavior that 
it observes, creating as small a program as It can find (In the sense of 
doing as much as possible by means of loops)* This Is the kind of process 
that would be appropriate, for example. In a system to learn the grammar 
embodied in some program, by discovering that certain groups of words 
all have the same effect on the program's branching and goes on to 
discover what certain kinds of phrases aTl have similar roles In causing 
that branching, 

3. Ira Goldstein Is developing a program capable of understanding 
very simple programs such as might be written by a beginning student* 
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A typical kind of task that this program will perform In Its elementary 
stages is to recognise the partial equivalence of the two LOGO programs 
PT and 92 r the verbal descriptions P3 and P4 and a rough drawing of a 
square: 

pi in n 

1 FORWARD 100 

2 RIGHT 00 

3 FORWARD 100 

4 RIGHT 90 

5 FORWARD 100 

6 RIGHT 90 

7 FORWARD TOO 

n TO 77 

1 FORWARD 100 

2 RIGHT 90 

3 77 

P3 Draw a line* then a right angle, then another line 
the same lengths a right angle, another line* 
another right angle, another line. 

P4 Draw a Tine, turn a right angle and keep repeating. 
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3. He have an extensive research program > funded by the HSF* to inves¬ 
tigate how children describe* develop and think about their own 
prccesses. 

(f) Mathematics and Math lab 

In cooperation with the groups in Project MAC * we pl an to 
continue developing relevant background theories in schemata* 
complexity, and applied mathematics. 

(g) ITS .and Programming 

We are very interested in decentralizing our use of computers, 
Some- ideas for doing this even for control of robotics are 
mentioned In the mini-robotics proposal* The situation is 
more straightforward in decentralizing programs of a purely 
symbolic Mud and we will attempt to develop some of our new 
projects through network access to other computers provided that 
arrangements can be made to guarantee appropriate service. 

(h) Visitors 

Our faculty will be reinforced by lung visits from K. Slum 
(Berkeley), V. Shirai {Electrotechnical Laboratory, Japan) 
and possibly M. Rabin {Israel). 
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Sample Dialog w| ch WToograd’s Language Understanding System 

The Blocks World was designed by Terry WI nog rad tn serve- as a 
subject domain for the development of a language understanding system. 
This system is radically nara ambitious than earlier attempts to deal 
with natural language In several respects. It is as much,, or more, 
concerned with the problem of representing the meanings In a discourse 
than with the grammatical structure? Wlnograd's thesis asserts that the 
interactions between meanings and sysntactle structures ere so Intimate 
that It is a bad Idea to try to drew a definite boundary between the 
procedures that "parse 111 a sentence and the procedures that are to be used 
to drew logical Inferences about the meanings carried In the discourse. 

Wlppgrad's system is a large, Intricate system, it Is described In his 
thesis at several levels of detail and It is impossible to do justice to 
It (n any short summary, Our strategy will be first to present an 
example of Its performance In the blocks world context, and then to 
discuss some of the mechanisms that are Involved In attaining thTs 
performance. In the following dialog the system carries out actions and 
answers questions about a simple scene contaInina a table, hand, and box, 
and several blocks and pyramids, as shown oiwpage -S. The lines In lower 
case were typed In by a person, and the lines in upper case are the 
computer's response. Indented lines are comments describing some of the 
features being exhibited. 

The dialog was carried out In real time with the response of the 
"robot” displayed on a CRT screen. Figures are Included to show the 
state of the scene at various points. 

1. pick up a big red block, 

....... ok. 

The system answers "OK" when It carries out a command. 
In order to pick up the red block. It had to dear ft 
off by finding a space for the green one and moving 
the green one away, 

Insert Figure on pige 3 about here. 


2. grasp the pyramid. 

■■«*« 1 DON'T UNDERSTAND WHICH PYRAMID YOU MEAN, 

There are three pyramids In the scene, so the program 
does nothing, 

3, find a block which is taller than the one you are holding and put It 
Into the box. 

The system has to figure out what is meant by "one" 
and "It", as well as using deduction to decide which 
block It Is holding* 

= =«..=.* UV "IT", I ASSUME YOU MEAN THE BLOCK WHICH IS TALLER THAN THE ONE 
1 AM HOLDING. 




Printed out to keep us Informed* 

.. OK. 

The hand has to set down the red block to pick up the 
taller blue one. Figure 4 shows the result of the 
system's manipulations so far. 

insert Figure it about here. 

L. what does the bo# contain? 

«""« THE BLUE PYRAMID AND THE BLUE BLOCK* 

5* what Is the pyramid supported by? 

Earlier the system responded to "the pyramid" by 
saying r, l don't understand which pyramid you mean." 
now. Ft assumes that we mean the one Just mentioned.. 

the box* 

6* how many blocks are not In the bos? 

Negatives such as ''rsot ir and "no 1 ' are handled whether 
they appear In clauses or noun groups (such as "no 
boxes") * 

—FOUR OF THEM* 

? + Is at least one of them narrower than the one which I told you to 

pick up? 

The phrase "at least one of them" must be analyzed to 
refer back to the previous Question, and "the one 
which*. must also refer to blocks* Noun groups can 
have complicated numbers such as "at least one", "more 
than three", "exactly seven", etc* The memory of past 
events Is used to deduce which block Is "the one which 
I told you to pick up"* Notice the complicated 
grammatical structure In which "the one" Is actually 
the object of a clause embedded two levels deeper* 
«**■»« YES # THE RED CUBE. 

6, Is it supported? 

Passives can be used without a "by ..." phrase. 

YES, BY THE TABLE* 

9* can the table pick up blocks? 

n = = =jra= WQ , 

The semantic rules make this a nonsensical question 
which therefore must be false* 

ID. can a pyramid be supported by a block? 

YES, 

The deductive system finds an actual example, so It 
knows this Is possible. 
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11. can a pyramid support a pyramid? 

* — ] DON'T KNOW* 

The question makes sense* but there Is no example fn 
the scene* so there Is no easy way to tali without 
being able to examine the robot's programs, 

12. stack up two pyramids, 

™ [ CAfl'T* 

The robot tries and fat Is* 

1J. the blue pyramid |& mine. 

I UNDERSTAND. 

This data Ts added to the data base. 

Hi. I own blocks which are not red* but I don't own anything which 
Supports a pyramid. 

------- I UNDERSTAND. 

Two new theorems are created for proving things about 
"own 1 ng ri t 

15. do I own tha box? 

-**•«* MO. 

The box supports whatever Pt contains, Including a 
pyramid. 

IS, do I own anything In the box? 

yes* TWO THI MGS: THE FLUE FLOCK AND THE FLUE PYRAMID. 

IT. win you please stack up both of the red blocks and either a green 
Cuba or a pyramid? 

....... OK, 

Logical connectives such as "and 11 "or M "either” etc* 
are handled fn both the grammar and semantics, A 
command Is executed even If It looks like a question* 
The command Is carried out by the following series of 
steps; It puts a green cube on the large red block 
(note that it chooses the green cube with nothing on 
ft)* thin removes the small pyramid from the little 
red cube, so that It can use that cube to complete the 
stack as specified, resulting In the situation of 
Figure 5. 

Insert Figure 5 about here* 

15. which cube Is sitting on the table? 

«•««* THE LARGE GREEN ONE WHICH SUPPORTS 

THE RED PYRAMID, 

Ordinarily the system would be satisfied with calling 
this "a big green cube"* but since we asked "which 
tube” It has to be more specific In naming It. The 
use of substitute nouns (such as "one"? and pronouns 
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In answers makes the discourse more natural. 

13, Fs there a large block behind a pyramid? 

*=*==== yes, THREE OF THEM: A LARGE RED ONE, 

A LARGE GREEN CUBE AND THE BLUE ONE* 

20, put a small one onto the green cube which supports a pyramid, 

"A small one" must contrast with the earlier phrase "a 
large block", so this means "a small block". 

........ Q K, 

21, put the 11ttlast pyramid on top of It. 

words like 11 II tt lest" are not In the dictionary but 
are Interpreted from the root forms like "little". 

OK. 

22, how many things are on top of green cubes? 

r'M NOT SURE WHAT YOU MEAN BY "ON TOP OF" IN THE PHRASE "OH TOP 
OF GREEN CUBES ", 

00 YOU MEAN: 

- 1 * DIRECTLY ON THE SURFACE 
I * ANWHERE ON TOP OF ? 

When there are several Interpretations and none Is 
clearly batter, the system has to ask the user for 
more Information. The paraphrases were Included with 
the dictionary definition of "on", and the request for 
more Information occurs automatically whan the 
heuristics for resolving ambiguities aren’t 

sufficient. A request like this can also occur when 
the meaning of a pronoun such as "It" or "they" Is 
unci ear. 


2 mrnmmmm THREE Of THEM. 

23. had you touched any pyramid before you put the green one on the 


11 ttle cube? 

....... YES, THE GREEN ONE. 

The system deduces from Its memory what event Is being 
referred to, and can relate other events to It In 
time. The analysis Includes complex tenses of verb 
grouos, and there are special facilities for keeping 
track Of the various forms of Irregular verbs like 
"have". 

24, when did you pick It up? 

....... WHILE F WAS STACKING UP THE RED CUBE, A LARGE RED SLOCK AND A 

LARGE GftEEN CUBE. 

A time Is described by naming the main goal which was 
belng worked on* 
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25* why? 

The system remembers TimnedTstely previous sentences co 
understand questions 11ke this and the following ones* 
«-*««« TO GET RID OF IT. 

By keeping track of selected parts of the original 
subgoal tree, the system has some understanding of Its 
own mo tt ves♦ 

26* why did you do that? 

TQ CLEAN OFF THE RED CUBE- 

"That" refers to the act Too mentioned In the answer* 

27* why did you clear off that cube? 

Here, "that" means the one just referred to* 
to PUT IT ON A LARGE GREEN CU&E* 

23* why did you do that? 

TO STACK OP THE RED CUBE , A LARGE RED BLOCK AND A LARGE GREEN 

CUBE. 

29* why did you do that? 

—. BECAUSE V00 ASKED ME TO* 

St remembers what Tt did, not how the request was 
worded * 

30* how did you do It? 

"It** refers to the action mentioned In the last 
question, rather than the answer* 

«•*•••■ SY PUTTING A LARGE RED 9LOCK ON THE TABLE } THEN LETTING GO OF 
IT ; THEN PUTTING A LARGE GREEN CUBE ON IT ; THEN LETTING GO OF THAT 
CUBE } THEN PUTTING THE RED CUBE ON THAT CUBE ; THEN LETTING GO OF THAT 
CUBE* 

This Is a description of the tdp-levaT subgoal* used 
to achieve the goal * 

31* how many objects did you touch while you were doing ft? 

Time references can refer to a time span, using words 
like "while 11 * 

■■■»■■* FOUR OF THEM* 

32, what did the red cube support before you started to clean. It off? 

*.- THE GREEN PYRAMID* 

By remembering what has bean moved. It is. possible to 
deduce things about earlier state* of the scene, We 
can use complex nestings of clauses like "before you 
started to clean It off"* 

33, there were five blocks to the left of the box then, 

"Then 11, must refer to the time mentioned In the last 
sentence, 
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=**■"■* mo, ONLY FOUR OF THEM! THE RED CUBE, TWO UPGE GfiEEN CUSPS AND A 

LARGE RED RLOCK, 

Earlier In the dialog, information about "owning" was 
accented at face value, Sut the system knows that It 
has complete Information about about the basic 
properties of objects, Including location, It checks 
an/ statements about location or other such properties 
to see whether it agrees or disagrees. 

34, put the blue pyramid on the block fn the bos. 

This (5 syntactically ambiguous, but is understood 
without ambiguity since the parsing corresponding to 
''put the blue pyramid which is on the block In the 
box," does pot make sense in the current scene, white 
"put the blue pyramid on the block which Is in the 
box" 1 does, 

------ OK* 

35* Is there anything which Is bigger than every pyramid but Is not as 

wide as the thing that supports it? 

The system understands quantifiers such as ’’every", 
"any", "'some 11 ', etc,, and structures such as "bigger 
than", "as wide as", etc. Hare "It” refers to a noun 
group within the same sentence but two levels up In 
the parsing tree. 

....... YES, THf SLUE SLOCK, 

36, does a steeple 

SORRY, I DON’T KNOW THE WORD "STEEPLE”, 

The System interrupts immediately when It sees a word 
It can 1 t figure out, 

37+ a "steeple" Is a stack which contains two green cubes and a pyramid. 

The quote marks Indicate that we are defining a new 
word, 

....... | UNDERSTAND. 

38, are there any steeples row? 

....... no. 

The new definition is used as a description to 
recognize steeples, 

39, build one. 

....... OK , 

The description given in the definition is enough to 
deduce a plan for building new objects as well as 
recognizing them. The word "one” means "a steeple", 
and the pish Is carried out. First, one of the Targe 
green cubes Is cleared off by removing the red cube 






PAGE 7 


and two pyramids* Then the two green cubes and the 
tall red pyramid are stacked up to fit the 
descript Eon, 


40* call the 

«■*«» OK. 


41* have you 
..YES, 


biggest block "superblock". 

We can also give objects names, 
picked up superblock since we began? 


42 * why did you drop It? 

— - BY "IT", | ASSUME YOU WEAN THE SUPERBLOCK, 

------- \ CAN'T EXPLAIN A NON-EXISTENT EVENT, 

There is no record in memory of any block being 
dropped* 


43, Ij there anything to the right of the red pyramid? 

------- YES, FOUR THINGS: THE SOX , THE RED CUBE, THE BLUE PYRAMID AND 

THE SLOCK HAH ED SUPERB LOCK 

Once en object has a name* the name Is used to help 
identify it In responses. 


*44, thank you* 

YOU'RE WELCOME! 

Idioms can be defined without modifying the grammar. 
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Figure 4 
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Flgure 5 
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jBnilci end, 
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k7' thank you, 
YOU T RE WELCOMEl 


The special fdlom-definlng capabilities allow 
us to Include such pleasantries without 
modifying the grammar •' 




•\ 


